45 



55 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Skatrud, Paul L. 

Peery, Robert B. 
de Waard, Maarten 

10 <ii) TITLE OF INVENTION: Multiple Drug Resistance Gene atrD of 

Aspergillus Nidulans 

(iii) NUMBER OF SEQUENCES: 3 

15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Eli Lilly and Company 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 
20 (E) COUNTRY: U.S. 

jS _ (F) ZIP: 46285 

= y (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
.2 25 (B) COMPUTER : IBM PC compatible " 

U\ (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

VQ (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

<vi) CURRENT APPLICATION DATA: 
fU 30 (A) APPLICATION NUMBER: 

if% (B) FILING DATE: 

^ (C) CLASSIFICATION: 

H (viii) ATTORNEY / AGENT INFORMATION: 

''Z 35 (A) NAME: Webster, Thomas D. 

E H (B) REGISTRATION NUMBER: 39,872 

!□ (C) REFERENCE/DOCKET NUMBER: X-11766 

! ?f (ix) TELECOMMUNICATION INFORMATION r 

U 40 (A) TELEPHONE: 317-276-3334 

Ij, (B) TELEFAX: 317-276-2763 



{2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTI-SENSE: NO 



(ix) FEATURE: 
60 (A) NAME/ KEY: CDS 

(B) LOCATION: 1..4002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

65 



V 



X-11766 



17 

ATG TCC CCG CTA GAG ACA AAT CCC CTT TCG CCA GAG ACT GCT ATG CGC 48 

Met Ser Pro Leu Glu Thr Asn Pro Leu Ser Pro Glu Thr Ala Met Arg 

1 5 10 15 

5 GAA CCT GCT GAG ACT TCA ACG ACG GAG GAG CAA GCT TCT ACA CCA CAC 96 

Glu Pro Ala Glu Thr Ser Thr Thr Glu Glu Gin Ala Ser Thr Pro His 

20 25 30 

GCT GCG GAC GAG AAG AAA ATC CTC AGC GAC CTC TCG GCT CCA TCT AGT 144 

10 Ala Ala Asp Glu Lys Lys lie Leu Ser Asp Leu Ser Ala Pro Ser Ser 

35 40 45 

ACT ACA GCA ACC CCC GCA GAC AAG GAG CAC CGT CCT AAA TCG TCG TCC 192 

Thr Thr Ala Thr Pro Ala Asp Lys Glu His Arg Pro Lys Ser Ser Ser 

15 50 55 60 

AGC AAT AAT GCG GTC TCG GTC AAC GAA GTC GAT GCG CTT ATT GCG CAC 240 

Ser Asn Asn Ala Val Ser Val Asn Glu Val Asp Ala Leu He Ala His 

65 70 75 80 



20 



40 



60 



CTG CCA GAA GAC GAG AGG CAG GTC TTG AAG ACG CAG CTG GAG GAG ATC 288 
Leu Pro Glu Asp Glu Arg Gin Val Leu Lys Thr Gin Leu Glu Glu He 
85 90 95 



•» 25 AAA GTA AAC ATC TCC TTC TTC GGT CTC TGG CGG TAT GCA ACA AAG ATG 336 

Lys Val Asn He Ser Phe Phe Gly Leu Trp Arg Tyr Ala Thr Lys Met 
10 100 105 110 

jH GAT ATA CTT ATC ATG GTA ATC AGT ACA ATC TGT GCC ATT GCT GCC GCG 384 

!U 30 Asp He Leu He Met Val He Ser Thr He Cys Ala He Ala Ala Ala 
m 115 120 125 

E _ TCG ACT TTC CAG AGG ATA ATG TTA TAT CAA ATC TCG TAC GAC GAG TTC 432 

ij Ser Thr Phe Gin Arg He Met Leu Tyr Gin He Ser Tyr Asp Glu Phe ~- 

g 35 130 135 140 

Q TAT GAT GAA TTG ACC AAG AAC GTA CTG TAC TTC GTA TAC CTC GGT ATC 480 

Tyr Asp Glu Leu Thr Lys Asn Val Leu Tyr Phe Val Tyr Leu Gly He 
145 150 155 160 



GGC GAG TTT GTC ACT GTC TAT GTT AGT ACT GTT GGC TTC ATC TAT ACC 528 
Gly Glu Phe Val Thr Val Tyr Val Ser Thr Val Gly Phe He Tyr Thr 
165 170 175 



45 GGA GAA CAC GCC ACG CAG AAG ATC CGC GAG TAT TAC CTT GAG TCT ATC 576 

Gly Glu His Ala Thr Gin Lys He Arg Glu Tyr Tyr Leu Glu Ser lie 
180 185 190 

CTG CGC CAG AAC ATT GGC TAT TTT GAT AAA CTC GGT GCC GGG GAA GTG 624 

50 Leu Arg Gin Asn He Gly Tyr Phe Asp Lys Leu Gly Ala Gly Glu Val 
195 200 205 

ACC ACC CGT ATA ACA GCC GAT ACA AAC CTT ATC CAG GAT GGC ATT TCG 672 

Thr Thr Arg He Thr Ala Asp Thr Asn Leu He Gin Asp Gly He Ser 
55 210 215 220 

GAG AAG GTC GGT CTC ACT TTG ACT GCC CTG GCG ACA TTC GTG ACA GCA 720 

Glu Lys Val Gly Leu Thr Leu Thr Ala Leu Ala Thr Phe Val Thr Ala 

225 230 235 240 



TTC ATT ATC GCC TAC GTC AAA TAC TGG AAG TTG GCT CTA ATT TGC AGC 768 
Phe He He Ala Tyr Val Lys Tyr Trp Lys Leu Ala Leu He Cys Ser 
245 250 255 



65 TCA ACA ATT GTG GCC CTC GTT CTC ACC ATG GGC GGT GGT TCT CAG TTT 816 
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Ser Thr lie Val Ala Leu Val Leu Thr Met Gly Gly Gly Ser Gin Phe 
260 265 270 

ATC ATC AAG TAC AGC AAA AAG TCG CTT GAC AGC TAC GGT GCA GGC GGC 864 
5 lie lie Lys Tyr Ser Lys Lys Ser Leu Asp Ser Tyr Gly Ala Gly Gly 
275 280 285 

ACT GTT GCG GAA GAG GTC ATC AGC TCC ATC AGA AAT GCC ACA GCG TTT 912 
Thr Val Ala Glu Glu Val lie Ser Ser He Arg Asn Ala Thr Ala Phe 
10 290 295 300 



15 



GGC ACC CAA GAC AAG CTT GCG AAG CAG TAT GAG GTC CAC TTA GAC GAA 
Gly Thr Gin Asp Lys Leu Ala Lys Gin Tyr Glu Val His Leu Asp Glu 
305 310 315 320 

GCT GAG AAA TGG GGA ACA AAG AAC CAG ATT GTC ATG GGT TTC ATG ATT 
Ala Glu Lys Trp Gly Thr Lys Asn Gin He Val Met Gly Phe Met He 
325 330 335 



960 



1008 



20 GGC GCC ATG TTT GGC CTT ATG TAC TCG AAC TAC GGT CTT GGC TTC TGG 1056 
Gly Ala Met Phe Gly Leu Met Tyr Ser Asn Tyr Gly Leu Gly Phe Trp 
340 345 350 

ATG GGT TCT CGT TTC CTG GTA GAT GGT GCA GTC GAT GTG GGT GAT ATT 1104 
25 Met Gly Ser Arg -Phe Leu Val Asp^Gly Ala Val Asp Val Gly Asp He 
355 360 365 

CTC ACA GTT CTC ATG GCC ATC TTG ATC GGA TCG TTC TCC TTG GGG AAC 1152 
Leu Thr Val Leu Met Ala He Leu He Gly Ser Phe Ser Leu Gly Asn 
30 370 375 380 

GTT AGT CCA AAT GCT CAA GCA TTT ACA AAC GCT GTG GCC GCG GCC GCA 1200 
Val Ser Pro Asn Ala Gin Ala Phe Thr Asn Ala Val Ala Ala Ala Ala 

385 "390 : 395 400 

35 - .-. • . . • 

AAG ATA TTT GGA ACG ATC GAT CGC CAG TCC CCA TTA GAT CCA TAT TCG 1248 
Lys He Phe Gly Thr He Asp Arg Gin Ser Pro Leu Asp Pro Tyr Ser 
405 410 415 

40 AAC GAA GGG AAG ACG CTC GAC CAT TTT GAG GGC CAC ATT GAG TTA CGC 1296 
Asn Glu Gly Lys Thr Leu Asp His Phe Glu Gly His He Glu Leu Arg 
420 425 430 

AAT GTC AAG CAT ATT TAC CCA TCT AGA CCC GAG' GTC ACC GTC ATG GAG 1344 
45 Asn Val Lys His He Tyr Pro Ser Arg Pro Glu Val Thr Val Met Glu 
435 440 445 

GAT GTT TCT CTG TCA ATG CCC GCT GGA AAA ACA ACC GCT TTA GTC GGC 1392 
Asp Val Ser Leu Ser Met Pro Ala Gly Lys Thr Thr Ala Leu Val Gly 
50 450 455 460 

CCC TCT GGC TCT GGA AAA AGT ACG GTG GTC GGC TTG GTT GAG CGA TTC 1440 
Pro Ser Gly Ser Gly Lys Ser Thr Val Val Gly Leu Val Glu Arg Phe 
465 470 475 480 

55 

TAC ATG CCT GTT CGC GGT ACG GTT TTG CTG GAT GGC CAT GAC ATC AAG 1488 
Tvr Met Pro Val Arg Gly Thr Val Leu Leu Asp Gly His Asp He Lys 
485 490 495 

60 GAC CTC AAT CTC CGC TGG CTT CGC CAA CAG ATC TCT TTG GTT AGC CAG 1536 
Asp Leu Asn Leu Arg Trp Leu Arg Gin Gin He Ser Leu Val Ser Gin 
500 505 510 

GAG CCT GTT CTT TTT GGC ACG ACG ATT TAT AAG AAT ATT AGG CAC GGT 1584 
65 Glu Pro Val Leu Phe Gly Thr Thr He Tyr Lys Asn He Arg His Gly 



# 



i 7"r 
i'T- 



10 
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515 520 525 

CTC ATC GGC ACA AAG TAC GAG AAT GAA TCC GAG GAT AAG GTC CGG GAA 1632 
Leu lie Gly Thr Lys Tyr Glu Asn Glu Ser Glu Asp Lys Val Arg Glu 
530 535 540 

CTC ATC GAG AAC GCG GCA AAA ATG GCG AAT GCT CAT GAC TTT ATT ACT 1680 
Leu lie Glu Asn Ala Ala Lys Met Ala Asn Ala His Asp Phe lie Thr 
545 550 555 560 

GCC TTG CCT GAA GGT TAT GAG ACC AAT GTT GGG GAG CGT GGC TTT CTC 1728 
Ala Leu Pro Glu Gly Tyr Glu Thr Asn Val Gly Gin Arg Gly Phe Leu 
565 570 575 

15 CTT TCA GGT GGC CAG AAA CAG CGC ATT GCA ATC GCC CGT GCC GTT GTT 177 6 

Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala Val Val 
580 585 590 

AGT GAC CCA AAA ATC CTG CTC CTG GAT GAA GCT ACT TCG GCC TTG GAC 1824 
20 Ser Asp Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp 
595 600 - 605 

ACA AAA TCC GAA GGC GTG GTT CAA GCA GCT TTG GAG AGG GCA GCT GAA 1872 
Thr Lys Ser Glu Gly Val Val Gin Ala Ala Leu Glu Arg Ala Ala Glu 

25 .- 610 s . r v '--^ ; 615 ^ ^ -620 - ^„ 

GGC CGA ACT ACT ATT GTG ATC GCT CAT CGC CTT TCC ACG ATC AAA ACG 1920 
Gly Arg Thr Thr He Val He Ala His Arg Leu Ser Thr He Lys Thr 
625 630 635 640 

\U 30 

VQ GCG CAC AAC ATT GTG GTT CTG GTC AAT GGC AAA ATT GCT GAA CAA GGA 1968 

Ala His Asn He Val Val Leu Val Asn Gly Lys He Ala Glu Gin Gly 
;^ 645 650 655 

? 35 ACT CAC GAT GAA TTG GTT GAC CGC GGA GGC GCT TAT CGC AAA CTT GTG 2016 
;l_ Thr His Asp Glu Leu Val Asp Arg Gly Gly Ala Tyr Arg Lys Leu Val 

\J 660 665 670 

■!2 GAG GCT CAA CGT ATC AAT GAA CAG AAG GAA GCT GAC GCC TTG GAG GAC 2064 

40 Glu Ala Gin Arg He Asn Glu Gin Lys Glu Ala Asp Ala Leu Glu Asp 
675 6B0 685 

GCC GAC GCT GAG GAT CTC ACG AAT GCA GAT ATT GCC AAA ATC AAA ACT 2112 
Ala Asp Ala Glu Asp Leu Thr Asn Ala Asp He Ala Lys He Lys Thr 
45 690 695 700 

GCG TCA AGC GCA TCA TCC GAT CTC GAC GGA AAA CCC ACA ACC ATT GAC 2160 
Ala Ser Ser Ala Ser Ser Asp Leu Asp Gly Lys Pro Thr Thr He Asp 
705 710 715 720 



50 



CGC ACG GGC ACC CAC AAG TCT GTT TCC AGC GCG ATT CTT TCT AAA AGA 2208 
Arg Thr Gly Thr His Lys Ser Val Ser Ser Ala He Leu Ser Lys Arg 
725 730 735 



55 CCC CCC GAA ACA ACT CCG AAA TAC TCA TTA TGG ACG CTG CTC AAA TTT 2256 

Pro Pro Glu Thr Thr Pro Lys Tyr Ser Leu Trp Thr Leu Leu Lys Phe 
740 745 750 

GTT GCT TCC TTC AAC CGC CCT GAA ATC CCG TAC ATG CTC ATC GGT CTT 2304 

60 Val Ala Ser Phe Asn Arg Pro Glu He Pro Tyr Met Leu He Gly Leu 
755 760 765 

GTC TTC TCA GTG TTA GCT GGT GGT GGC CAA CCC ACG CAA GCA GTG CTA 2352 

Val Phe Ser Val Leu Ala Gly Gly Gly Gin Pro Thr Gin Ala Val Leu 

65 770 775 780 
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TAT GCT AAA GCC ATC AGC ACA CTC TCG CTC CCA GAA TCA CAA TAT AGC 2400 
Tyr Ala Lys Ala He Ser Thr Leu Ser Leu Pro Glu Ser Gin Tyr Ser 
785 790 795 800 

5 

AAG CTT CGA CAT GAT GCG GAT TTC TGG TCA TTG ATG TTC TTC GTG GTT 2448 
Lys Leu Arg His Asp Ala Asp Phe Trp Ser Leu Met Phe Phe Val Val 
805 810 815 

10 GGT ATC ATT CAG TTT ATC ACG CAG TCA ACC AAT GGT GCT GCA TTT GCC 2496 
Gly He He Gin Phe He Thr Gin Ser Thr Asn Gly Ala Ala Phe Ala 
820 825 830 

GTA TGC TCC GAG AGA CTT ATT CGT CGC GCG AGA AGC ACT GCC TTT CGG 2544 
15 Val Cys Ser Glu Arg Leu He Arg Arg Ala Arg Ser Thr Ala Phe Arg 

-— 835 ™~~840 --845 

ACG ATA CTC CGT CAA GAC ATT GCT TTC TTT GAC AAG GAA GAG AAT AGC 2592 
Thr lie. Leu Arg Gin Asp He Ala Pne Phe Asp Lys Glu Glu Asn Ser 
20 850 855 860 

ACC GGC GCT CTG ACC TCT TTC CTG TCC ACC GAG ACG AAG CAT CTC TCC 2640 
Thr Gly Ala Leu Thr Ser Phe Leu Ser Thr Glu Thr Lys His Leu Ser 
865 _ 870 875 880 



25 



45 



GGT GTT AGC GGT GTG ACT CTA GGC ACG ATC TTG ATG ACC TCC ACG ACC 2688 
Gly Val Ser Gly Val Thr Leu Gly Thr He Leu Met Thr Ser Thr Thr 
885 890 895 



30 CTA GGA GCG GCT ATC ATT ATT GCC CTG GCG ATT GGG TGG AAA TTG GCC 273 6 

Leu Gly Ala Ala He He He Ala Leu Ala He Gly Trp Lys Leu Ala 
900 905 910 

TTA GTT TGT ATC TCG GTT GTG CCG GTT CTC CTG GCA TGC GGT TTC TAC 2784 

35 Leu Val Cys He Ser Val Val Pro Val Leu Leu Ala Cys Gly Phe Tyr 

915 920 925 

CGA TTC TAT ATG CTA GCC CAG TTT CAA TCA CGC TCC AAG CTT GCT TAT 2832 

Arg Phe Tyr Met Leu Ala Gin Phe Gin Ser Arg Ser Lys Leu Ala Tyr 

40 930 935 940 

GAG GGA TCT GCA AAC TTT GCT TGC GAG GCT ACA TCG TCT ATC CGC ACA 2880 

Glu Gly Ser Ala Asn Phe Ala Cys Glu Ala Thr Ser Ser He Arg Thr 
945 950 955 960 



GTT GCG TCA TTA ACC CGG GAA AGG GAT GTC TGG GAG ATT TAC CAT GCC 2928 
Val Ala Ser Leu Thr Arg Glu Arg Asp Val Trp Glu He Tyr His Ala 
965 970 975 



50 CAG CTT GAC GCA CAA GGC AGG ACC AGT CTA ATC TCT GTC TTG AGG TCA 2976 
Gin Leu Asp Ala Gin Gly Arg Thr Ser Leu He Ser Val Leu Arg Ser 
980 985 990 

TCC CTG TTA TAT GCG TCG TCG CAG GCA CTT GTT TTC TTC TGC GTT GCG 3024 
55 Ser Leu Leu Tyr Ala Ser Ser Gin Ala Leu Val Phe Phe Cys Val Ala 
995 1000 1005 

CTC GGG TTT TGG TAC GGA GGG ACA CTT CTT GGT CAC CAC GAG TAT GAC 3072 
Leu Gly Phe Trp Tyr Gly Gly Thr Leu Leu Gly His His Glu Tyr Asp 
60 1010 1015 1020 

ATT TTC CGC TTC TTT GTT TGT TTC TCC GAG ATT CTC TTT GGT GCT CAA 3120 
He Phe Arg Phe Phe Val Cys Phe Ser Glu He Leu Phe Gly Ala Gin 
1025 1030 1035 1040 

65 
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TCC 
Ser 


GCG 
Ala 


GGC ACC 
Gly Thr 


GTC TTT 
Val Phe 
1045 


TCC 
Ser 


TTT 
Phe 


GCA 
Ala 


CCA GAC 
Pro Asp 
1050 


ATG 

Met 


GGC AAG 
Gly Lys 


GCG AAG 
Ala Lys 
1055 


3168 




5 


AAT 
Asn 


GCG 
Ala 


GCC 
Ala 


GCC GAA 
Ala Glu 
1060 


TTC 
Phe 


CGA 
Arg 


CGA 
Arg 


CTG TTC 
Leu Phe 
1065 


GAC 
Asp 


CGA 
Arg 


AAG 
Lys 


CCA CAA 
Pro Gin 
1070 


ATT 
He 


3216 




10 


GAT 
Asp 


AAC 
Asn 


TGG TCT 
Trp Ser 
1075 


GAA 
Glu 


GAG GGC 
Glu Gly 


GAG AAG 
Glu Lys 
1080 


CTC 
Leu 


GAA 
Glu 


ACG 
Thr 


GTG GAA 
Val Glu 
1085 


GGT 
Gly 


GAA 
Glu 


3264 




15 


ATC 
He 


GAA TTT 
Glu Phe 
1090 


AGG 
Arg 


AAC 
Asn 


GTG 
Val 


CAC TTC 
His Phe 
1095 


AGA 
Arg 


TAC 
Tyr 


CCG 
Pro 


ACC CGC 
Thr Arg 
1100 


CCA 
Pro 


GAA 
Glu 


CAG 
Gin 


3312 




20 


CCT GTC 
Pro Val 
1105 


CTG 
Leu 


CGC 
Arg 


GGC 
Gly 


TTG GAC 
Leu Asp 
1110 


CTG 
Leu 


ACC 
Thr 


GTG 
Val 


AAG CCT 
Lys Pro 
1115 


GGA 
Gly 


CAA 
Gin 


TAT 
Tyr 


GTT 
Val 
1120 


3360 


□ 


GCG 
Ala 


CTT 
Leu 


GTC 
Val 


GGA 
Gly 


CCC AGC GGT 
Pro Ser Gly 
1125 


TGT 
Cys 


GGC 
Gly 


AAG AGT 
Lys Ser 
1130 


ACC 
Thr 


ACC 
Thr 


ATT 
He 


GCA TTG 
Ala Leu 
1135 




I J j 


25 


CTT 
Leu 


GAG 
Glu 


CGC 
Arg 


TTT TAC 
Phe Tyr 
1140. 


GAT GCG 
Asp Ala 


ATT 
He 


GCC GGG 
Ala Gly 
1145 


TCC 
Ser 


ATC 
He 


CTT 
Leu 


GTT GAT 
Val Asp 
1150 


GGG 
Gly 


3456 


Hi 


30 


AAG 
Lys 


GAC 
Asp 


ATA AGT 
He Ser 
1155 


AAA 
Lys 


CTA 
Leu 


AAT 
Asn 


ATC AAC 
He Asn 
1160 


TCC 
Ser 


TAC 
Tyr 


CGC 
Arg 


AGC TTT 
1 Ser Phe 
1165 


CTG 
Leu 


TCA 
Ser 


3504 




35 


CTG 

- Leu 


GTC 
Val 


AGC 
Ser 


CAG 
Gin 


GAG 
Glu 


CCG 
Pro 


ACA CTG 
Thr Leu 

-1175- . - 


TAC 
Tyr 


CAG 
Gin 


GGC 
Gly 


ACC ATC 
Thr He 
1180 


AAG 
Lys 


GAA 
Glu 


AAC 
Asn 


3552 






J. A / U 






















O 

o 


40 


ATC TTA 
He Leu 
1185 


CTT 
Leu 


GGT 
Gly 


ATT 
He 


GTC GAA 
Val Glu 
1190 


GAT 
Asp 


GAC 
Asp 


GTA 
Val 


CCG GAA 
Pro Glu 
1195 


GAA 
Glu 


TTC 
Phe 


TTG 
Leu 


ATT 
He 
1200 


3600 




AAG 
Lys 


GCT 
Ala 


TGC 
Cys 


AAG 
Lys 


GAC GCT 
Asp Ala 
1205 


AAT 
Asn 


ATC 
He 


TAC 
Tyr 


GAC TTC 
Asp Phe 
1210 


ATC 
He 


ATG 
Met 


TCG 
Ser 


CTC CCG 
Leu Pro 
1215 


3648 




45 


GAG 
Glu 


GGC 
Gly 


TTT 
Phe 


AAT ACA 
Asn Thr 
1220 


GTT 
Val 


GTT 
Val 


GGC 
Gly 


AGC AAG 
Ser Lys 
1225 


GGA 
Gly 


GGC 
Gly 


ATG 
Met 


TTG TCT 
Leu Ser 
1230 


GGC 
Gly 


3696 




50 


GGC 
Gly 


CAA 
Gin 


AAG CAA 
Lys Gin 
1235 


CGT 
Arg 


GTG 
Val 


GCC 
Ala 


ATT GCC 
He Ala 
1240 


CGA 
Arg 


GCC 
Ala 


CTT 
Leu 


CTT CGG 
Leu Arg 
1245 


GAT 
Asp 


CCC 
Pro 


3744 




55 


AAA 
Lys 


ATC CTT 
He Leu 
1250 


CTT 
Leu 


CTC 
Leu 


GAT 
Asp 


GAA GCG 
Glu Ala 
1255 


ACG 
Thr 


TCA 
Ser 


GCC 
Ala 


CTC GAC 
Leu Asp 
1260 


TCC 
Ser 


GAG 
Glu 


TCA 
Ser 


3792 




60 


GAA AAG 
Glu Lys 
1265 


GTC 
Val 


GTC 
Val 


CAG 
Gin 


GCG GCT 
Ala Ala 
1270 


TTG 
Leu 


GAT 
Asp 


GCC 
Ala 


GCT GCC 
Ala Ala 
1275 


CGA 
Arg 


GGC 
Gly 


CGA 
Arg 


ACC 
Thr 
1280 


3840 




ACA 
Thr 


ATC 
He 


GCC 
Ala 


GTT 
Val 


GCA CAC 
Ala His 
1285 


CGA 
Arg 


CTC 
Leu 


AGC 
Ser 


ACG ATT 
Thr He 
1290 


CAA 
Gin 


AAG 
Lys 


GCG 
Ala 


GAC GTT 
Asp Val 
1295 


3888 



65 ATC TAT GTT TTC GAC CAA GGC AAG ATC GTC GAA AGC GGA ACG CAC AGC 3936 
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He Tyr Val Phe Asp Gin Gly Lys He Val Glu Ser Gly Thr His Ser 
1300 1305 1310 

GAA CTG GTC CAG AAA AAG GGC CGG TAC TAC GAG CTG GTC AAC TTG CAG 3984 
5 Glu Leu Val Gin Lys Lys Gly Arg Tyr Tyr Glu Leu Val Asn Leu Gin 
1315 1320 1325 

AGC TTG GGC AAG GGC CAT 4002 
Ser Leu Gly Lys Gly His 
10 1330 

(2) INFORMATION FOR SEQ ID NO: 2: 

15 ^ (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1334 amino acids 
'(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ser Pro Leu Glu Thr Asn Pro Leu Ser Pro Glu Thr Ala Met Arg 
- 25 1 5 10 15 

Glu Pro Ala Glu Thr Ser Thr Thr Glu Glu Gin Ala Ser Thr Pro His 
20 25 30 

30 Ala Ala Asp Glu Lys Lys lie Leu Ser Asp Leu Ser Ala Pro Ser Ser 
35 40 45 

Thr Thr Ala Thr Pro Ala Asp Lys Glu His Arg Pro Lys Ser Ser Ser 

50 — ■ - 55 - ~ 60 _ - 



- 35 -- 

C Ser Asn Asn Ala Val Ser Val Asn Glu Val Asp Ala Leu lie Ala His 

□ 65 70 75 80 



Leu Pro Glu Asp Glu Arg Gin Val Leu Lys Thr Gin Leu Glu Glu lie 
40 85 90 95 

Lys Val Asn lie Ser Phe Phe Gly Leu Trp Arg Tyr Ala Thr Lys Met 
100 105 110 

45 Asp lie Leu lie Met Val lie Ser Thr lie Cys Ala lie Ala Ala Ala 
115 120 125 



50 



65 



Ser Thr Phe Gin Arg lie Met Leu Tyr Gin lie Ser Tyr Asp Glu Phe 
130 135 140 

Tyr Asp Glu Leu Thr Lys Asn Val Leu Tyr Phe Val Tyr Leu Gly lie 
145 150 155 160 



Gly Glu Phe Val Thr Val Tyr Val Ser Thr Val Gly Phe lie Tyr Thr 
55 165 170 175 

Gly Glu His Ala Thr Gin Lys lie Arg Glu Tyr Tyr Leu Glu Ser lie 
180 185 190 

60 Leu Arg Gin Asn lie Gly Tyr Phe Asp Lys Leu Gly Ala Gly Glu Val 
195 200 205 



Thr Thr Arg lie Thr Ala Asp Thr Asn Leu lie Gin Asp Gly lie Ser 
210 215 220 



15 



30 



45 



60 
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Glu Lys Val Gly Leu Thr Leu Thr Ala Leu Ala Thr Phe Val Thr Ala 
225 230 235 240 

Phe lie lie Ala Tyr Val Lys Tyr Trp Lys Leu Ala Leu lie Cys Ser 
5 245 250 255 

Ser Thr lie Val Ala Leu Val Leu Thr Met Gly Gly Gly Ser Gin Phe 
260 265 270 

10 He He Lys Tyr Ser Lys Lys Ser Leu Asp Ser Tyr Gly Ala Gly Gly 
275 280 285 



Thr Val Ala Glu Glu Val He Ser Ser He Arg Asn Ala Thr Ala Phe 

290 295 300 

Gly Thr Gin Asp Lys Leu Ala Lys Gin Tyr Glu Val His Leu Asp Glu 

305 310 315 320 



Ala Glu Lys Trp Gly Thr Lys Asn Gin lie Val Met Gly Phe Met lie 
20 325 330 335 

Gly Ala Met Phe Gly Leu Met Tyr Ser Asn Tyr Gly Leu Gly Phe Trp 
340 345 350 

25 Met Gly Ser Arg Phe Leu Val Asp Gly Ala Val Asp Val' Gly Asp He 
355 360 365 



Leu Thr Val Leu Met Ala He Leu He Gly Ser Phe Ser Leu Gly Asn 

370 375 380 

Val Ser Pro Asn Ala Gin Ala Phe Thr Asn Ala Val Ala Ala Ala Ala 
385 390 395 400 



"Lys "He Phe Gly Thr lie Asp Arg Gin Ser Pro Leu Asp Pro Tyr Ser 
35 405 ' 410 415 

Asn Glu Gly Lys Thr Leu Asp His Phe Glu Gly His He Glu Leu Arg 
420 425 430 

40 Asn Val Lys His He Tyr Pro Ser Arg Pro Glu Val Thr Val Met Glu 
435 440 445 



Asp Val Ser Leu Ser Met Pro Ala Gly Lys Thr Thr Ala Leu Val Gly 
450 455 460 

Pro Ser Gly Ser Gly Lys Ser Thr Val Val Gly Leu Val Glu Arg Phe 
465 470 475 480 



Tyr Met Pro Val Arg Gly Thr Val Leu Leu Asp Gly His Asp He Lys 

50 485 490 495 

Asp Leu Asn Leu Arg Trp Leu Arg Gin Gin He Ser Leu Val Ser Gin 
500 505 510 

55 Glu Pro Val Leu Phe Gly Thr Thr He Tyr Lys Asn He Arg His Gly 
515 520 525 



Leu He Gly Thr Lys Tyr Glu Asn Glu Ser Glu Asp Lys Val Arg Glu 
530 535 540 

Leu He Glu Asn Ala Ala Lys Met Ala Asn Ala His Asp Phe He Thr 

545 550 555 560 



Ala Leu Pro Glu Gly Tyr Glu Thr Asn Val Gly Gin Arg Gly Phe Leu 
65 565 570 575 
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Leu Ser Gly Gly Gin Lys Gin Arg lie Ala He Ala Arg Ala Val Val 
580 585 590 

Ser Asp Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp 
595 600 605 

Thr Lys Ser Glu Gly Val Val Gin Ala Ala Leu Glu Arg Ala Ala Glu 
610 615 620 

Gly Arg Thr Thr He Val He Ala His Arg Leu Ser Thr He Lys Thr 
625 630 635 640 

Ala His Asn He Val Val Leu Val Asn Gly Lys He Ala Glu Gin Gly 
645 650 655 

Thr His Asp Glu Leu Val Asp Arg Gly Gly Ala Tyr Arg Lys Leu Val 
660 665 670 

Glu Ala Gin Arg He Asn Glu Gin Lys Glu Ala Asp Ala Leu Glu Asp 
675 680 685 

Ala Asp Ala Glu Asp Leu Thr Asn Ala Asp He Ala Lys He Lys Thr 

690 695 700 ....... 

Ala Ser Ser Ala Ser Ser Asp Leu Asp Gly Lys Pro Thr Thr He Asp 
705 710 715 720 

Arg Thr Gly Thr His Lys Ser Val Ser Ser Ala He Leu Ser Lys Arg 
725 730 735 

Pro Pro Glu Thr Thr Pro Lys Tyr Ser Leu Trp Thr Leu Leu Lys Phe 
740 745 750 

Val Ala Ser Phe Asn Arg Pro Glu He Pro Tyr Met Leu lie Gly Leu 
755 760 765 

Val Phe Ser Val Leu Ala Gly Gly Gly Gin Pro Thr Gin Ala Val Leu 
770 775 780 

Tyr Ala Lys Ala He Ser Thr Leu Ser Leu Pro Glu Ser Gin Tyr Ser 
785 790 795 800 

Lys Leu Arg His Asp Ala Asp Phe Trp Ser Leu Met Phe Phe Val Val 
805 810 815 

Gly He He Gin Phe He Thr Gin Ser Thr Asn Gly Ala Ala Phe Ala 
820 825 830 

Val Cys Ser Glu Arg Leu He Arg Arg Ala Arg Ser Thr Ala Phe Arg 
835 840 845 

Thr He Leu Arg Gin Asp He Ala Phe Phe Asp Lys Glu Glu Asn Ser 
850 855 860 

Thr Gly Ala Leu Thr Ser Phe Leu Ser Thr Glu Thr Lys His Leu Ser 
865 870 875 880 

Glv Val Ser Gly Val Thr Leu Gly Thr He Leu Met Thr Ser Thr Thr 
885 890 895 

Leu Gly Ala Ala He He He Ala Leu Ala He Gly Trp Lys Leu Ala 
900 905 910 

Leu Val Cys He Ser Val Val Pro Val Leu Leu Ala Cys Gly Phe Tyr 



# 



X-11766 

25 

915 920 925 

Arg Phe Tyr Met Leu Ala Gin Phe Gin Ser Arg Ser Lys Leu Ala Tyr 
930 935 940 

5 

Glu Gly Ser Ala Asn Phe Ala Cys Glu Ala Thr Ser Ser lie Arg Thr 
945 950 955 960 

Val Ala Ser Leu Thr Arg Glu Arg Asp Val Trp Glu He Tyr His Ala 
10 965 970 975 

Gin Leu Asp Ala Gin Gly Arg Thr Ser Leu He Ser Val Leu Arg Ser 
980 985 990 

15 Ser Leu Leu Tyr Ala Ser Ser Gin Ala Leu Val Phe Phe Cys Val Ala 
995 1000 1005 

Leu Gly Phe Trp Tyr Gly Gly Thr Leu Leu Gly His His Glu Tyr Asp 
1010 1015 1020 

20 

!□ He Phe Arg Phe Phe Val Cys Phe Ser Glu He Leu Phe Gly Ala Gin 

;g 1025 1030 1035 1040 

ser Ala Gly Thr Val Phe Ser Phe Ala Pro Asp Met Gly Lys Ala Lys 
||=| 25 1045 1050 * 1055 

^ Asn Ala Ala Ala Glu Phe Arg Arg Leu Phe Asp Arg Lys Pro Gin He 

IB 1060 1065 1070 

"jf 30 Asp Asn Trp Ser Glu Glu Gly Glu Lys Leu Glu Thr Val Glu Gly Glu 
\$J 1075 1080 1085 



i 



35 



50 



He Glu Phe Arg Asn Val His Phe Arg Tyr Pro Thr Arg Pro Glu Gin 
1090 1095 HOO 

Pro Val Leu Arg Gly Leu Asp Leu Thr Val Lys Pro Gly Gin Tyr Val 
1105 1110 1H5 1120 



Ala Leu Val Gly Pro Ser Gly Cys Gly Lys Ser Thr Thr He Ala Leu 
40 1125 1130 ■ 1135 

Leu Glu Arg Phe Tyr Asp Ala He Ala Gly Ser He Leu Val Asp Gly 
1140 1145 1150 

45 Lys Asp He Ser Lys Leu Asn He Asn Ser Tyr Arg Ser Phe Leu Ser 
1155 1160 1165 



Leu Val Ser Gin Glu Pro Thr Leu Tyr Gin Gly Thr He Lys Glu Asn 
1170 1175 1180 

He Leu Leu Gly He Val Glu Asp Asp Val Pro Glu Glu Phe Leu He 
1185 H90 1195 1200 



Lys Ala Cys Lys Asp Ala Asn He Tyr Asp Phe He Met Ser Leu Pro 
55 1205 1210 1215 

Glu Gly Phe Asn Thr Val Val Gly Ser Lys Gly Gly Met Leu Ser Gly 
1220 1225 1230 

60 Gly Gin Lys Gin Arg Val Ala He Ala Arg Ala Leu Leu Arg Asp Pro 
1235 1240 1245 



Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ser Glu Ser 
1250 1255 1260 



65 



* 
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15 
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1 jj 
O 

U 40 
45 
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55 
60 
65 



Glu Lys Val Val Gin Ala Ala Leu Asp Ala Ala Ala Arg Gly Arg Thr 
1265 1270 1275 1280 

Thr lie Ala Val Ala His Arg Leu Ser Thr lie Gin Lys Ala Asp Val 
1285 1290 1295 

He Tyr Val Phe Asp Gin Gly Lys He Val Glu Ser Gly Thr His Ser 
1300 1305 1310 

Glu Leu Val Gin Lys Lys Gly Arg Tyr Tyr Glu Leu Val Asn Leu Gin 
1315 1320 1325 

Ser Leu Gly Lys Gly His 
1330 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4002 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 : 



AUGUCCCCGC 


UAGAG ACAAA 


UCCCCUUUCG 


CCAGAGACUG 


CUAUGCGCGA 


ACCUGCUGAG 


60 


ACUUCAACGA 


CGGAGGAGCA 


AGCUUCUACA 


CCACACGCUG 


CGGACGAGAA 


GAAAAUCCUC 


120 


AGCGACCUCU 


CGGCUCCAUC 


UAGUACUACA 


GCAACCCCCG 


CAGACAAGGA 


GCACCGUCCU 


180 


AAAUCGUCGU 


CCAGCAAUAA 


UGCGGUCUCG 


GUCAACGAAG 


UCGAUGCGCU 


UAUUGCGCAC 


240 


CUGCCAGAAG 


ACGAGAGGCA 


GGUCUUGAAG 


AC GC AGCUGG 


AGGAGAUCAA 


AGUAAACAUC 


300 


UCCUUCUUCG 


GUCUCUGGCG 


GUAUGCAACA 


AAGAUGGAUA 


UACUUAUCAU 


GGUAAUCAGU 


360 


ACAAUCUGUG 


CCAUUGCUGC 


CGCGUCGACU 


UUCCAGAGGA 


UAAUGUUAUA 


UCAAAUCUCG 


420 


UACGACGAGU 


UCUAUGAUGA 


AUUGACCAAG 


AACGUACUGU 


ACUUCGUAUA 


CCUCGGUAUC 


480 


GGCGAGUUUG 


UCACUGUCUA 


UGUUAGUACU 


GUUGGCUUCA 


UCUAUACCGG 


AGAACACGCC 


540 


ACGCAGAAGA 


UCCGCGAGUA 


UUACCUUGAG 


UCUAUCCUGC 


GCCAGAACAU 


UGGCUAUUUU 


600 


GAUAAACUCG 


GUGCCGGGGA 


AGUGACCACC 


CGUAUAACAG 


CCGAUACAAA 


CCUUAUCCAG 


660 


GAUGGCAUUU 


CGGAGAAGGU 


CGGUCUCACU 


UUGACUGCCC 


UGGCGACAUU 


CGUGACAGCA 


720 


UUCAUUAUCG 


CCUACGUCAA 


AUACUGGAAG 


UUGGCUCUAA 


UUUGCAGCUC 


AACAAUUGUG 


780 


GCCCUCGUUC 


UCACCAUGGG 


CGGUGGUUCU 


CAGUUUAUCA 


UCAAGUACAG 


CAAAAAGUCG 


840 


CUUGACAGCU 


ACGGUGCAGG 


CGGCACUGUU 


GCGGAAGAGG 


UCAUCAGCUC 


CAUCAGAAAU 


900 


GCCACAGCGU 


UUGGCACCCA 


AGACAAGCUU 


GCGAAGCAGU 


AUGAGGUCCA 


CUUAGACGAA 


960 
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GCUGAGAAAU 


GGGGAACAAA 


GAACCAGAUU 






GGCCUUAUGU 


ACUCGAACUA 


CGGUCUUGGC 




5 


GGUGCAGUCG 


AUGUGGGUGA 


UAUUCUCACA 






UCCUUGGGGA 


ACGUUAGUCC 


AAAUGCUCAA 




10 


AAGAUAUUUG 


GAACGAUCGA 


UCGCCAGUCC 






ACGCUCGACC 


AUUUUGAGGG 


CCACAUUGAG 




1 5 


AGACCCGAGG 
GCUUUAGUCG 


UCACCGUCAU 
GCCCCUCUGG 


GGAGGAUGUU 
CUCUGGAAAA 






UACAUGCCUG 


UUCGCGGUAC 


GGUUUUGCUG 




20 


CGCUGGCUUC 


GCCAACAGAU 


CUCUUUGGUU 






AUUUAUAAGA 


AUAUUAGGCA 


CGGUCUCAUC 


E » ■ 

IB 




AAGGUCCGGG 
GCCUUGCCUG 


AACUCAUCGA 
AAGGUUAUGA 


GAACGCGGCA 
GACCAAUGUU 


3 ;"3 

*t i 




CAGAAACAGC 


GCAUUGCAAU 


CGCCCGUGCC 


sy 
IS 


30 


GAUGAAGCUA 


CUUCGGCCUU 


GGACACAAAA 






AGGGCAGCUG 


AAGGCCGAAC 


UACUAUUGUG 




•a k 


GCGCACAACA 
UUGGUUGACC 


UUGUGGUUCU 
GCGGAGGCGC 


GGUCAAUGGC 
UUAUCGCAAA 


1 1 




AAGGAAGCUG 


ACGCCUUGGA 


GGACGCCGAC 




40 


AAAAUCAAAA 


CUGCGUCAAG 


CGCAUCAUCC 






CGCACGGGCA 


CCCACAAGUC 


UGUUUCCAGC 




45 


ACUCCGAAAU 
AUCCCGUACA 


ACUCAUUAUG 
UGCUCAUCGG 


GACGCUGCUC 
UCUUGUCUUC 






CAAGCAGUGC 


UAUAUGCUAA 


AGCCAUCAGC 




50 


AAGCUUCGAC 


AUGAUGCGGA 


UUUCUGGUCA 






UUUAUCACGC 


AGUCAACCAA 


UGGUGCUGCA 




55 


CGCGCGAGAA 
GAAGAGAAUA 


GCACUGCCUU 
GCACCGGCGC 


UCGGACGAUA 
UCUGACCUCU 






GGUGUUAGCG 


GUGUGACUCU 


AGGCACGAUC 




60 


AUCAUUAUUG 


CCCUGGCGAU 


UGGGUGGAAA 






GUUCUCCUGG 


CAUGCGGUUU 


CUACCGAUUC 






AAGCUUGCUU 


AUGAGGGAUC 


UGCAAACUUU 



65 



27 



GUCAUGGGUU 


UCAUGAUUGG 


CGCCAUGUUU 


lUzu 


UUCUGGAUGG 


GUUCUCGUUU 


CCUGGUAGAU 


1080 


GUUCUCAUGG 


CCAUCUUGAU 


CGGAUCGUUC 


1140 


GCAUUUACAA 


ACGCUGUGGC 


CGCGGCCGCA 


1200 


CCAUUAGAUC 


CAUAUUCGAA 


CGAAGGGAAG 


1260 


UUACGCAAUG 


UCAAGCAUAU 


UUACCCAUCU 


1320 


UCUCUGUCAA 


UGCCCGCUGG 


AAAAACAACC 


1380 


AGUACGGOGG 


UCGGCUUGGU 


UGAGCGAUUC 


1440 


G AUGGC C AUG 


ACAUCAAGGA 


CCUCAAUCUC 


1500 


AGCCAGGAGC 


CUGUUCUUUU 


UGGCACGACG 


1560 


GGCACAAAGU 


ACGAGAAUGA 


AUCCGAGGAU 


1620 


AAAAUGGCGA 


AUGCUCAUGA 


CUUUAUUACU 


1680 


GGGCAGCGUG 


GCUUUCUCCU 


UUCAGGUGGC 


1740 


GUUGUUAGUG 


ACCCAAAAAU 


CCUGCUCCUG 


1800 


UCCGAAGGCG 


UGGUUCAAGC 


AGCUUUGGAG 


1860 


AUCGCUCAUC 


GCCUUUCCAC 


GAUCAAAACG 


1920 


AAAAUUGCUG 


AACAAGGAAC 


UCACGAUGAA 


1980 


CUUGUGGAGG 


CUCAACGUAU 


CAAUGAACAG 


2040 


GCUGAGGAUC 


UCACGAAUGC 


AGAUAUUGCC 


2100 


GAUCUCGACG. 


GAAAACCCAC 


AACCAUUGAC 


2160 


GCGAUUCUUU 


CUAAAAGACC 


CCCCGAAACA 


2220 


AAAUUUGUUG 


CUUCCUUCAA 


CCGCCCUGAA 


2280 


UCAGUGUUAG 


CUGGUGGUGG 


CCAACCCACG 


2340 


ACACUCUCGC 


UCCCAGAAUC 


ACAAUAUAGC 


2400 


UUGAUGUUCU 


UCGUGGUUGG 


UAUCAUUCAG 


2460 


UUUGCCGUAU 


GCUCCGAGAG 


ACUUAUUCGU 


2520 


CUCCGUCAAG 


ACAUUGCUUU 


CUUUGACAAG 


2580 


UUCCUGUCCA 


CCGAGACGAA 


GCAUCUCUCC 


2640 


UUGAUGACCU 


CCACGACCCU 


AGGAGCGGCU 


2700 


UUGGCCUUAG 


UUUGUAUCUC 


GGUUGUGCCG 


2760 


UAUAUGCUAG 


CCCAGUUUCA 


AUCACGCUCC 


2820 


GCUUGCGAGG 


CUACAUCGUC 


UAUCCGCACA 


2880 
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GUUGCGUCAU UAACCCGGGA AAGGGAUGUC 
CAAGGCAGGA CCAGUCUAAU CUCUGUCUUG 
5 GCACUUGUUU UCUUCUGCGU UGCGCUCGGG 
CACGAGUAUG ACAUUUUCCG CUUCUUUGUU 
UCCGCGGGCA CCGUCUUUUC CUUUGCACCA 

10 

GAAUUCCGAC GACUGUUCGA CCGAAAGCCA 
AAGCUCGAAA CGGUGGAAGG UGAAAUCGAA 
15 CGCCCAGAAC AGCCUGUCCU GCGCGGCUUG 
GCGCUUGUCG GACCCAGCGG UUGUGGCAAG 
UACGAUGCGA UUGCCGGGUC CAUCCUUGUU 

20 

AACUCCUACC GCAGCUUUCU GUCACUGGUC 
AUCAAGGAAA ACAUCUUACU UGGUAUUGUC 
2 5 AAGGCUUGCA AGGACGCUAA UAUCUACGAC 
ACAGUUGUUG GCAGCAAGGG AGGCAUGUUG 
GCCCGAGCCC UUCUUCGGGA UCCCAAAAUC 

30 

GACUCCGAGU CAGAAAAGGU CGOCCAGGCG 
ACAAUCGCCG-UUGCACACGG ACUCAGCACG 
35 GACCAAGGCA AGAUCGUCGA AAGCGGAACG 
UACUACGAGC UGGUCAACUU GCAGAGCUUG 



28 



UGGGAGAUUU 


ACCAUGCCCA 


GCUUGACGCA 


2940 


AGGUCAUCCC 


UGUUAUAUGC 


GUCGUCGCAG 


3000 


UUUUGGUACG 


GAGGGACACU 


UCUUGGUCAC 


3060 


UGUUUCUCCG 


AGAUUCUCUU 


UGGUGCUCAA 


3120 


GACAUGGGCA 


AGGCGAAGAA 


UGCGGCCGCC 


3180 


CAAAUUGAUA 


ACUGGUCUGA 


AG AGGGC GAG 


3240 


UUUAGGAACG 


UGCACUUCAG 


AUACCCGACC 


3300 


GACCUGACCG 


UGAAGC CUGG 


ACAAUAUGUU 


3360 


AGUACCACCA 


UUGCAUUGCU 


UGAGCGCUUU 


3420 


GAUGGGAAGG 


ACAUAAGUAA 


ACUAAAUAUC 


3480 


AGCCAGGAGC 


CGACACUGUA 


CCAGGGCACC 


3540 


GAAGAUGACG 


UACCGGAAGA 


AUUCUUGAUU 


3600 


UUCAUCAUGU 


CGCUCCCGGA 


GGGCUUUAAU 


3660 


UCUGGCGGCC 


AAAAGCAACG 


UGUGGCCAUU 


3720 


CUUCUUCUCG 


AUGAAGCGAC 


GUCAGCCCUC 


3780 


GCUUUGGAUG 


CCGCUGCCCG 


AGGCC GAACC 


3840 


AUUCAAAAGG 


CGGACGUUAU 


CUAUGUUUUC 


3900 


' CAGAGCGAAC 


UGGUGCAGAA 


AAAGGGCCGG 


3960 


GGCAAGGGCC 


AU 




4002 



